PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre10.g441300.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family AP2
Protein Properties Length: 1562aa    MW: 153284 Da    PI: 5.278
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre10.g441300.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP231.93.3e-10859900147
                 AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaai 47 
                         s y+GV wd+k ++W+A+I +   n  +   +lg+++t eeAa+a +
  Cre10.g441300.t1.1 859 SVYRGVVWDEKENKWRAQIVE---N--NGINYLGYYDTQEEAARAFD 900
                         78****************655...4..37889************987 PP

2AP236.31.4e-11945996155
                 AP2   1 sgykGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkleg 55 
                         s+ykGV+w++   +WvA ++d       ++  ++g++ ++e+Aa+a+++ ++++ g
  Cre10.g441300.t1.1 945 SQYKGVSWNSACSKWVAVLWD----ReLKRARHIGSYESEEDAARAYDKEALRMLG 996
                         78*******************....3344888*********************987 PP

3AP243.11e-1310311080155
                 AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                          s+y+GV+w+++++rW+++  ++  +   k++ +g+f  + eAa+a+++a ++l+g
  Cre10.g441300.t1.1 1031 SQYRGVSWHERSQRWEVR--VW-GG--GKQHFIGSFTEEVEAARAYDRAVLRLRG 1080
                          78**************77..55.22..4**********99*************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5103211.706625681IPR001471AP2/ERF domain
SuperFamilySSF541711.77E-8627682IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.101.4E-9628682IPR001471AP2/ERF domain
SMARTSM003806.0E-5628687IPR001471AP2/ERF domain
SuperFamilySSF541712.16E-10859915IPR016177DNA-binding domain
PfamPF008479.8E-7859900IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.107.2E-10860915IPR001471AP2/ERF domain
PROSITE profilePS5103211.772860915IPR001471AP2/ERF domain
SMARTSM003804.4E-5860921IPR001471AP2/ERF domain
PfamPF008472.3E-5945996IPR001471AP2/ERF domain
CDDcd000186.51E-149451006No hitNo description
SuperFamilySSF541712.88E-129451005IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.107.6E-139461006IPR001471AP2/ERF domain
SMARTSM003806.5E-129461010IPR001471AP2/ERF domain
PROSITE profilePS5103213.2749461004IPR001471AP2/ERF domain
CDDcd000185.05E-1110311094No hitNo description
SuperFamilySSF541712.29E-1110311093IPR016177DNA-binding domain
PfamPF008473.2E-610311080IPR001471AP2/ERF domain
PROSITE profilePS5103214.43410321092IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.101.9E-1010321093IPR001471AP2/ERF domain
SMARTSM003803.3E-1010321098IPR001471AP2/ERF domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1562 aa     Download sequence    Send to blast
MSAQAQPAGQ GPGKKPLHVA IPGDGGAGGG GALDTPLDFQ SPSFESLGLF SLGPNTGRQS  60
QSGNEIAGSL LSPFGNGLQD LDLLISPLLT NGAAGGFNAE GGPGGQPGQG SVPGQGAAHQ  120
QLPGTQGAGA STSGQGQAGG TAGEAGPGPG SSAAAAVASL AGRDPAAAAA AIRKAMGPPS  180
TISAQMQQKL GLSGGAGGSS TDGDKKSGRA GGLPRPPPLT IPADVPSNII TGINAIGGAT  240
GMSPLPSILN SLNSPLLSAS DNAQLQAMFQ QLAQSTRGGQ TPGLLGALTP GTAGAWPDST  300
KSGGGTLRQL LEMQDGLGDL GLGDMSDIDE AALADALFSK GFLGPLSARR TSITSGAGAG  360
GGRDAGGSDG DGAGGSGGAG GAGAGELSAE QLASMGLQSP SALGMGGNGH VLYKGVHYDK  420
DQDMWQVVVF DGSRCTVVGE YTNELEALVA NELLSPAQPM PGNLNTLLAG GNAGAGGSQG  480
GAGPSGAAAG GDSGVANGAG GHAAGQGGAG GGQGPSGAGN RTPSVTEYLS RLASGNNLLG  540
GPLSPGGAGL SPLFSALSPA GGGGFTALLL PTPREPGNGG AGAHLLPSPT GFGRGGLGQG  600
QDAPPPGAAG QAQGLEGDDD GGQVALHGVE FKPEEGKWAA VINDGEHTEV VGLFDSNIEA  660
ARAYDQEALR RLGPKAELNF PLEALSAAVA GLTGGQPLPL GLPGGGLLDP NLAAAGGFDA  720
VQQAAMALGL TGLASGMQGL EGGVGGGYDE EGDEEGDDDD DHLDLGPLPL AGAFGTAARG  780
RGRGRGRGRG RGRGRGRGRG RGHDDDDEDF VLGSLRNEAS SSGRGRGRGR GRGRGATLTT  840
IMPSQPAPEI IGPDGKKESV YRGVVWDEKE NKWRAQIVEN NGINYLGYYD TQEEAARAFD  900
GAVLRTGSKE LLNFPLVPKA AAPKARGPRG PGKVEGDTRR AKVTSQYKGV SWNSACSKWV  960
AVLWDRELKR ARHIGSYESE EDAARAYDKE ALRMLGPEAG LNFRESAADY LAEIGADGMP  1020
EGSHNSNKGS SQYRGVSWHE RSQRWEVRVW GGGKQHFIGS FTEEVEAARA YDRAVLRLRG  1080
QDARSRSRMN FPLSEYNMDD LGPMPGADAG FLGLMGGLRS TPEPKPKKAQ RKKRGRDDDY  1140
SDSDDDGMPV RGHYGSGGGA AQSAAAANRA AQQQLTAFLQ TALQQQLAAG GGPGGAAGAG  1200
PGGVPGGAGG DAEQQARALA ALTAAAAASG LVLPGLPGMP GGGLLGMLTG LGGSGPGAGA  1260
SPTQGKPGTS PPLPGGLLSA NKAPPPPGMP GQPPPLGPSA QLGGGGMGPG GEEQHMAASY  1320
MGGPEPTDDD GGFHDMGPMG HIIKQEQHVL NLGGPTQPPG SGGSAGLGHK QEPATGMPPS  1380
AAALFMDTSE SPMGKGPAPG PGGLPPGFAA GSDGGSPAPG SGGPVLNLGG LLGPQPHGQA  1440
ARGGHGCGGG AQQFQVSSEL SPSAAAHTAA PPSGKPVLDL GPSPAPPPAG PGVAQPAPAA  1500
AGGGRRGKRG TEAALDLGGP VGGGSGPADQ DAPGSAGAGV VLNLGGTPPA TKRRRGDADA  1560
H*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1823833RGRGRGRGRGR
2825834RGRGRGRGRG
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Cre.124410.0normal conditions| nutrient deficiency
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre10.g441300.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001690353.11e-172hypothetical protein CHLREDRAFT_188358, partial
TrEMBLA8IID21e-172A8IID2_CHLRE; Predicted protein (Fragment)
STRINGEDP056121e-171(Chlamydomonas reinhardtii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP574088
Representative plantOGRP4971784
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G37750.17e-22AP2 family protein